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Abstract 

The aim of this paper is twofold: to provide a theoretical framework and to give further 
empirical support to Shiller's test of the appropriateness of prices in the stock market based 
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on the Cyclically Adjusted Price Earnings (CAPE) ratio. We devote the first part of the 

in 

paper to the empirical analysis and we show that the CAPE is a powerful predictor of 
^ future long run performances of the market not only for the U.S. but also for countries such 

us Belgium, France, Germany, Japan, the Netherlands, Norway, Sweden and Switzerland. 

•l-H 

^ We show four relevant empirical facts: i) the striking ability of the logarithmic averaged 

H 

earning over price ratio to predict returns of the index, with an R squared which increases 
with the time horizon, ii) how this evidence increases switching from returns to gross 
returns, iii) moving over different time horizons, the regression coefficients are constant 
in a statistically robust way, and iv) the poorness of the prediction when the precursor 
is adjusted with long term interest rate. In the second part we provide a theoretical 
justification of the empirical observations. Indeed we propose a simple model of the price 
dynamics in which the return growth depends on three components: a) a momentum 
component, naturally justified in terms of agents' belief that expected returns are higher in 
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bullish markets than in bearish ones; b) a fundamental component proportional to the log 
earnings over price ratio at time zero. The initial value of the ratio determines the reference 
growth level, from which the actual stock price may deviate as an effect of random external 
disturbances, and c) a driving component ensuring the diffusive behaviour of stock prices. 
Under these assumptions, we are able to prove that, if we consider a sufficiently large 
number of periods, the expected rate of return and the expected gross return are linear in 
the initial time value of the log earnings over price ratio, and their variance goes to zero 
with rate of convergence equal to minus one. Ultimately this means that, in our model, 
the stock prices dynamics may generate bubbles and crashes in the short and medium 
run, whereas for future long-term returns the valuation ratio remains a good predictor. 

JEL codes: G12, G17 
Keywords: Valuation Ratios, Long Run Stock Market Returns 

1 Introduction 

A question that has been asked with a certain regularity since the establishment of the first 
stock index is the following, see Myers and Swaminathan (1999): is it possible to calculate "the 
Intrinsic Value of the Dow"? Or, more generally: what is the intrinsic value of Wall Street? A 
first commonsense answer is that this value may be calculated by summing "the intrinsic value" 
of each quoted company. At the beginning of the past century, see Williams (1938), Graham 
and Meredith (1998), it was well known that the latter depends on the expected future earnings 
and on the balance sheet, but unfortunately there was no easy formula, which yielded even a 
rough approximation. Therefore aggregating all expectations for all companies quoted in the 
market soon appeared to be a challenging task. 

Between the Fifties and the Sixties the researchers' attitude towards the problem changed 
radically. Modern portfolio theory and the statistical evidence that stock prices follow ran- 
dom walks, see Mandelbrot (1965), Fama (1965), has led to the Efficient Market Hypothesis, 
see Samuelson (1965), Fama (1970). According to this theory, financial markets are "informa- 
tionally efficient", that is, one can not achieve returns in excess of average market returns on 
a risk-adjusted basis, given the information available at the time the investment is made. If 
this were the case "the intrinsic value" of a stock would be simply its market price and the 
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problem would ultimately be non existent. The bubble burst in 1987 and the exceptional price 
boom in the late '90 lead more and more researcher to cast doubts on the hypothesis' truth. 
At the very beginning of 2000 Robert Shiller wrote: "we do not know whether the market level 
makes any sense, or whether they are indeed the result of some human tendency that might be 
called irrational exuberance", Shiller (2000), which is apparently at odds with the assumption 
that market prices are always fair prices. Shiller's skeptical attitude was anticipated in one of 
the most famous books ever written on the stock market: in 1934, Graham and Dodd strongly 
advocated the fundamental approach to investment valuation and recommended to "shift (s) 
the original point of departure, or basis of computation, from the current earnings to the av- 
erage earnings, which should cover a period of not less than five years, and preferably seven 
to ten years" Graham et al. (1962). Since 1988 Campbell and Shiller have been focusing their 
attention on average earnings as predictor of future dividends 1 and future stock prices 2 ; Shiller 
(2000) proposed an innovative test of the appropriateness of prices in the stock market: the 
Cyclically Adjusted Price Earning ratio, and showed that it is a powerful predictor of future 
long run performances of the market; the performance of the test is quite satisfactory in the 
case of the US market from the end of 19 th century up to today. In the same years, again 
elaborating on Graham et al. (1962), Lander et al. (1997) suggested an alternative but similar 
criterion based on the assumption "that many investors are constantly making a choice between 
stock and bond purchases; as the yield on bonds advances, they would be expected to demand 
a correspondingly higher return on stocks, and conversely as bond yields decline Graham et al. 
(1962, pag. 510)"; their "approach considers whether stocks are appropriately valued relative 
to analysts' perceptions of future earnings and yields on alternative investments" such as high 
grade corporate bonds (for a comparison between the two approaches see Weigand and Irons 
(2006)). 

A first goal of this paper is to provide further empirical support to the Shiller test by 
considering the dependence of future returns on the valuation of the stock index at the time of 
investing in several countries different from the United States (more precisely we will investigate 

1 "Our results indicate that a long moving average of real earnings helps to forecast future real dividends. 
The ratio of this earnings variable to the current stock price is a powerful predictor of the return on stock, 
particularly when the return is measured over several years", see Campbell and Shiller (1988a). 

2 The price-smoothed earnings ratio has little ability to predict future growth in smoothed earnings; the R 
squared statistics are less than 4% over one year and over ten years. The ratio is a good forecaster of ten-year 
growth in stock prices, with an R squared statistic of 37%, see Campbell and Shiller (1998). 
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the relationship for Australia, Belgium, Canada, France, Germany, Japan, the Netherlands, 
Norway, Sweden, Switzerland and the United Kingdom). For all these countries, even though 
the historical series are much shorter than those available for the US, we are able to perform 
an exhaustive analysis of the relationship between CAPE and long-term index returns. The 
empirical analysis confirms the ability of the CAPE to predict future long run performances 
of the market also in the case of these countries. The evidence is robust: a regression analysis 
shows that the logarithmic averaged earning over price ratio predicts the future returns of the 
index, with an R squared which is steadily increasing the longer the future time horizon is. 
Indeed this evidence increases when switching from returns to gross returns, and moving over 
different time horizons, the regression coefficients are constant in a statistically robust way. We 
also remark how attempting to adjust the precursor with long term interest rate, as suggested 
by the so-called Fed model, negatively affects the accuracy of the prediction. 

A second goal of this paper is to provide a theoretical framework which explains how future 
stock index returns depend on the Cyclically Adjusted Price Earnings (CAPE) ratio. This is 
achieved by constructing a model. 

What we require of our model is similar to what Chiarella required in Chiarclla (1992, see 
pag. 102) 

"Firstly we require that the model generate a significant transitory component around the 
equilibrium which reflects the rationally expected value of the asset. Secondly the model must 
allow for the incorporation of chartists, a group which bases its market actions on an analysis of 
past trends. Since this group seems to be an important part of real markets it is important to 
determine what effect its activity has on the behavior of asset prices and whether the behavior 
of a model incorporating chartists comes closer to explaining some of the empirical results cited 
earlier" . 

We choose, however, a different approach and we work at an aggregate level instead of 
explicitly modelling two different types of agents. We introduce a moment component in a 
"simple error-correction model that predicts the return of the S&P based on the deviations 
from a presumed equilibrium between . . . earnings yields" Lander et al. (1997, see pag. 3) and 
a long run target yield rate. The return growth depends on three components 

a) a momentum component, naturally justified in terms of agents' expectation that expected 
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returns are higher in bullish markets than in bearish ones; 

b) a fundamental component, proportional to a function of the level of the logarithmic 
averaged earnings over price ratio (for brevity log EP ratio) at time zero; the initial time 
value of log EP ratio determines the reference growth level, from which the actual stock 
price may deviate as an effect of random external disturbances; 

c) a driving component ensuring the diffusive behavior of stock prices. 

Under these assumptions, we are able to prove that, if we consider a sufficiently large number 
of periods, the expected rate of return and the expected gross return are linear in the initial 
time value of log EP, and their variance converges to zero with rate of convergence equal to 
minus one. This means that, in our model, the stock prices dynamics may generate bubbles 
and crashes in the short and medium run, nevertheless the averaged earnings price ratio is a 
good predictor of future long-run returns, as claimed by Campbell and Shiller (1988a), Shiller 
(2000), Lander et al. (1997). 

2 Data set and empirical facts 

The data set analyzed in this paper consists of records on a monthly basis of prices, earnings, 
and dividends for stock price indexes from twelve developed countries. In more detail, for the 
US market we consider the Standard and Poor Composite Stock Price Index data discussed 
in Campbell and Shiller (1987, 1988a, b), and freely available from Robert J. Shiller's webpage 
http://www. econ.yale. edu/~ 'shiller/. These time series cover the entire period from January 1871 
until March 2011. For Australia, Belgium, Canada, France, Germany, Japan, the Netherlands, 
Norway, Sweden, Switzerland, and United Kingdom we resort to MSCI Indexes, MSCI Dividend 
Yields, and MSCI Price over Earning Ratios, kindly provided by Factset 3 . At variance with US 
data, these series span a narrower time window running from December 1969 to December 2010. 
All nominal series are in local currencies and have been deflated using Consumer Price Indexes 
available at the Federal Reserve Economic Data repository http://research.stlouisfed.org/fred2/. 
In the very early part of our work we also wish to study excess index returns over long term 
3 http:/ / www. factset. com/. 
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debt for the US market. The interest rate we use is the 10- Year Treasury Constant Maturity 
Rate (GS10). 

The real price of the stock index, measured at the beginning of time period t, will be written 
P t , while for the real dividend paid on the portfolio during the period between t and t + 1 we 
will use D t . Accordingly, the realized log gross return on the index, held from the beginning of 
time t and the beginning of time t + 1 is written 

H t = ]og (P t+ i+A)-logP*. 

Since data are recorded every month, we naturally refer the notation t + 1 to the time instant 
t increased by one month. Accordingly, the realized gross yield on a monthly 1 basis over h 
periods is 

j h-l 

Vt.h = j^Yl H t+i ■ W 

i=0 

We also introduce the log return on the index X t = logPf+i — logPt, in terms of which the 
gross yield can be rewritten 

w , k 4£x t+i +IX> g (l + i^), (2) 

i=0 i=0 v t+i+t/ 

where the first term on the right hand side readily reduces to (log Pt+h — logP t )//i. 

In the first part of our empirical work we regress yields on some explanatory variables that 

are known at the start of month t. We consider the following variables: the log earnings - 

price ratio log (e)J° — logP t , and the log earnings - price ratio centered on the prevailing long 

term interest rate log(e) 10 — log Pi — logr t . The quantity (e)J° refers to a moving average of 

real earnings over a time window of ten years. At variance with Campbell and Shiller (1988b), 

where the analysis is based on geometrical averages, we consider here arithmetical ones. The 

very reason is that in this way the current study can be readily adapted to the case of negative 

earnings. While an average over ten years resulting in a negative value is quite an unrealistic 

situation, it is not so uncommon to deal with spot realized earnings which are in fact losses. 

In this latter case the definition of loge* is problematic. However, for the data set under 
4 Throughout this paper annual yields are obtained multiplying y t by a factor 12. 
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consideration here this is never the case and we expect that moving from one average to the 
other does not change the overall picture. The use of an average of earnings in computing the 
ratios has been strongly pushed by the literature in recognition of the cyclical variability of 
earnings. In Graham et al. (1962) an approach is recommended that "shifts the original point 
of departure, or basis of computation, from the current earnings to the average earnings, which 
should cover a period of not less than five years, and preferably seven to ten years." 

In figures 1, 2, and 3 the solid lines represent regressions for the period 1871-2010, truncated 
where required in order to compute average over earnings and multi period returns. The solid 
points correspond to empirical data, while the role of dashed lines will be clarified later. In 
table 1 returns are measured over two until sixteen years (with the exclusion of odd years), as 
reported in the first column on a monthly basis. The second, third and fourth column give 
results of the regression of (log P t +h — log Pt) /h on log (e) j° — log P t — log r t and log (e) ]° — log P t , 
and of y t; h on log (e)J° — logP t , respectively; a and (3 correspond to the intercept and the 
precursor's coefficient, while e a and ep are the associated standard errors. For each regression 
we report the R squared statistics and the significance level for an F test of the null that 
each coefficient is zero. Being the dependent variable a multi period return, we have corrected 
for autocorrelation effects employing the Newey-West correction, but we have not adjusted 
results for heteroscedasticity. All computational tasks have been performed with freely available 
software from the R project, http://www.r-project.org/. The table and figures show four relevant 
empirical facts: a) the striking ability of the log earning over price ratio to predict returns of 
the index, with an increasing R squared the longer the time horizon, b) this evidence increases 
when moving from returns to gross returns, c) when moving over different time horizons, the 
regression coefficients keep constant in a statistically robust way, and d) the poorness of the 
prediction when the precursor is adjusted with long term interest rate. In light of the third 
point, we can ultimately explain the meaning of dashed lines in figures 1, 2, and 3 . They 
correspond to linear curves whose coefficient are weighted averages of those obtained for fixed 
horizons, the weights being defined in terms of the R squared and of the standard error of 
each ah and (3h (see section 4 for more details). Tables 2-4 report results for the same analysis 
performed over MSCI Indexes for eleven more countries. We can roughtly group results in three 
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Figure 1: Regression of yield (log P t+ h — log Pt)/h on the explanatory log earning - price 
ratio centered on r t . Solid points: empirical data, solid line: linear regression, dashed line: 
averaged linear regression. Points are organized in chronological order according to the color 
scale ranging from dark blue to red passing through light blue, green, yellow, and orange; labels 
in the top left panel refer to points corresponding to the first month of the specified year. 
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Figure 2: Regression of yield (log P t+ h — log P t ) /h on the explanatory log earning - price ratio. 
Points, lines, labels, and color scale as in figure 1. 
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Figure 3: Regression of yield (1) on the explanatory log earning - price ratio. Points, lines, 
labels, and color scale as in figure 1. 
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Table 1: All values for a, /3, and associated errors are expressed in 10 4 units; p-values of order 
or smaller than 10~ 4 have been left blank. 



main categories: impressive forecasting power with a fraction of explained variance reaching 
levels of order 90% (Belgium, France, Germany, Japan, the Netherlands, United Kingdom); very 
poor forecasting capability and modest or negligible R squared level (Australia and Canada); 
countries (Norway, Sweden, Switzerland) with an overall good performance, but for which the 
fraction of explained variance behaves anomalously for long horizons (see ten and twelve years 
returns for Norway, and twelve years returns for Sweden and Switzerland). The increase of R 
squared induced by gross returns is also confirmed, even though this evidence is less strong 
than for the case of US data. It is worth to stress that the difference of sample size between 
US and remaining countries data surely plays a role which is however hard to quantify. It is 
far from the aim of this work to provide an economic explanation of the behaviour of Australia 
and Canada. In the next section we will introduce a dynamical model able to reproduce the 
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empirical evidences shared by the majority of countries we have just analyzed. 



3 The model 

In the following we will prove some results concerning a dynamics driven both by a value 
investment and a momentum component. In the first instance we will consider only the role 
played by stock prices, but later we will take into account gross returns and the effect of 
dividends. Our model rests on the following two assumptions inspired by the evidences identified 
in the previous section. 

Assumption 1 Financial returns X t are naturally driven by the growth rate fi t , and by an 
ancillary process £ t satisfying 

6+i = 6 + T^^xW t x , (3) 

1 - 7 

with initial time condition £ = 0. The £ t process ensures the diffusive behaviour of stock prices. 
As will be clear later, the positive parameter ax corresponds to the long run returns volatility, 
while the pref actor depending on k > and < 7 < 1 has been added for future convenience. 
The 's for t = 0, . . . , h are independent identically distributed (i.i.d.) standard Gaussian 
increments. 

Assumption 2 The dynamics of log returns' growth rate is given in terms of the difference 
equation 

fh+i =llk + K [log(e) t 10 - X t + ft + gJ (log(e) J° - log P ) t] + oJJV? (4) 

where the parameter a M is a positive volatility constant, and the Wf 1 's for t = 0, . . . , h are i.i.d. 
standard Gaussian increments, not dependent on the 's. The return growth /i t depends on 
its own value in the previous period, with 7 agents' sensitivity to market trend. This momentum 
effect can be naturally justified in terms of agents ' expectation that returns are higher in bullish 
markets than in bearish ones. The "fundamental" component is proportional to a function 
5F of the level of the logarithmic averaged earnings over price ratio at time zero, where the 
proportionality constant is given by the rate of earnings growth g, implicitly defined by (e)]° = 
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(e)l° exp (gt) . This component grows linearly with time, but we also allow for a correction 
equal to "K, possibly function of the initial time log EP ratio. The initial time value of log EP 
determines the reference growth level, from which the actual stock price may deviate as an effect 
of random external disturbances. Investors reallocate assets in response to this disequilibrium 
causing stock prices to move in the direction that reduces the deviation. The full adjustment 
is not immediate but it takes a typical time /t" 1 the return to mean revert to the fundamental 
level. 

In conclusion, our model states the dynamics of log returns can be fully specified by the linear 
stochastic difference system 

X t+l =X t + Ht + £ t 

< th+i =l^t + K{\og(e)f-X t + 'K + gn) + o- i W? , (5) 

with X = logPo and fj,o initial time conditions. By a further change of variable Y t = X t — 
log(e)o°, the expected values E [F t ], E E [£ t ] conditional at the information available at 
time evolve according to the linear first order difference system 

E [Y t+1 ] =E [y t ]+E H+E [6] 

< Eo[^+i] =-KE [F t ]+7EoN + ^ + ^(l + 3 r )t , 
E [6+i] =E [6] 

with the initial conditions 

E [Fo] =X -log(e)i° 

< E [/x ] = A*o 
EoN =0 

v 

Lemma 1. The expected rate of return is linear in 3 r (\og(e)l° — logPo); provided we consider 
a sufficiently large number of periods h 

= g(l + 3) + 0(h. (6) 



E 



1 , Po+h 
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Moreover its variance converges to zero with rate of convergence equal to minus one 



Vara 



1 , Po+h 



a 



Proof. The stochastic dynamical system (5) may be put in the vector form 



(7) 



K 



Vt+i = JVt + k'K e 2 + Kg(l + 3)t e 2 + a^Wfa + ——a x W^e 3 



1-7 



(8) 



where J 



111 
—k 7 
1 



is the matrix of the systems, for i — 1,2,3 is the orthonormal 



canonical base, and V+ 



Y t 



k and 7, and two ones more 



. The set of the eigenvalues of J includes 1, independently of 



K = 2 4^ + ^V / (1-7) 2 -4k, 
A- = ^-^/(l-7) 2 -4«. 

In order to ensure |A + | < 1 and |A_| < 1, we need to impose the following restrictions 

(1"7) 2 



< 7 < 1 , and < « < 



4 



(9) 



Iterating (8) we obtain 

V h = f l V + K[g(l + ?)(h-l) + •K} S ^te 2 - ng{l + 5) ^ fcj fe e 2 



h-l 



h-l 



h-l 



k=0 

h-l 



k=0 



fc=0 



fc=0 



We can rewrite more conveniently 



1 by means of the matrices A 



1 
A+ 
A_ 
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with 



-7 1 1 
-k A + - 1 A_ - 1 

K 



and 



A_ - A+ 

k(A_-1) -re (l-7)(l-A_)-« 
re(l-A+) re «-(l-7)(l-A+) 



re(A_ — A + ). Taking expectation of e\V h we obtain 



h-l 



h-l 



+k [g(l +T)(h-1)+M\Y^ e^QA fc Q _1 e 2 - ng{l + J) ^ fce^QA^Q-^ 



fc=0 



A 



[(1-A_)F Q + H 



A'j 



A_ - A+ LV ~ 7 u ^ UJ A_ - A+ 
-0(1 + 30 (ft - 1) + K - [#(1 + 3") - IK] 

1 - A ft 



fc=0 

[(i - A + )r + H 

At 



A_ - A^ V 1 — A 



A'j 



1 - A_ 



-9(1 + 2)- 



A_ 



1-A. 



A_ - A 

From previous expression relation (6) immediately follows. 
As far as the variance of Yh is concerned, we have 



1- A. 



(10) 



h-l 



Var [Y h \ = 



k=0 



1 e 2 ) 2 + 7r ^4(e t 1 QA fe Q- 1 e 3 ) : 



(11) 



from which we find 



Var [Y h 



h-1 



(A_ - A. 



£[A*-Ay 



fc=0 



(7 



A' 



(A- - x+y 



h-l 

£ 

A:=0 



A_ - A + + A;(l-A_ 



1-7- 



+ A^( 



1-7 



1 + A. 



and the thesis (7) follows. 



□ 



It is a posteriori clear why we have fixed the proportionality constant of the fundamental 
component equal to g. Indeed, in case g was zero, not only averaged earnings but also stock 
returns would grow sub-exponentially, which is quite reasonable. The variance of Y^ increases 
linearly with h, which is a diffusive behavior of stock returns any realistic model should satisfy 
at the leading order. Finer effects, like possible return non zero autocorrelations induced by 
business cycles over long horizons could be included in our model modifing the covariance 
structure of the {W^} noise. Moreover it is worth to remark that none of our result crucially 
rest on the Gaussian nature of noise increments. 
From previous Lemma, the interesting result we state below follows. 
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Corollary 2. If 5F zs linear in \og(e)l° — logPo? i/ie model (5) is able to reproduce the 

linear scaling of returns with the initial log EP ratio on the long run. 

The next step forward is to add a dividend components to our stock returns model. In this 
respect we mainly follow the proposal discussed in Campbell and Shiller (1988a,b), where it is 
argued that the log dividend price ratio follows a stationary stochastic process, and the fixed 
mean of \ogD t — \ogP t) log 9, can be used as an expansion point 

A(dt_! - X t ) = -9(d t ^ -X t - log 9) + a d W t d , (12) 

with d t = \ogD t , a d > 0, and {W 7 ^} for t = 1, . . . , h i.i.d. standard Gaussian variates. At 
variance with the proposal of Campbell and Shiller, we also assume that 9 can possibly depend 
on the log EP at time zero, i.e. 9 = 9(log(e)Q° — logP ). 

Lemma 3. In the limit h^$> 1 the quantity ^ Yli=o 1°S + con ^ r ^ u ^ es to the rate of log 

gross returns proportionally to 9, and its variance converges to zero with rate minus one. 

Proof. On a monthly basis the quantity D t /P t+ i is of order 10 -3 , and we are allowed to replace 
log ^1 + jrj^J with Di/Pi + i. In the same spirit of Campbell and Shiller (1988a,b), in order to 
prove the thesis we need to Taylor expand quantities of interest around log 9- In particular, we 
have 

= exp (di - X i+l ) ~ exp (log 9) (1 + d { - X i+l - log 9) . 

Iterating equation (12) and taking expectation, we obtain for % ^ 

Eo[di_i - X,} = (1 - - X ) + log 9 [1 - (1 - BY] , (13) 

where d_i is the log dividend prevailing at time zero being cumulated between time -1 and 
zero. Under the constraint < 9 < 2, we can conclude 

1. 



h 



"- 1 ' Di 



II— L / 

E lo s 1 

t=0 ^ 



— , g(i-iog9) + giog9 + gi-^(iogpp -iog9) 1 (1 e)h 



19 



and 

h-l 



Var 



s5> 



li^ ° Pi 

i=0 



A 



i+l 



1 ^ + /> 



h0{2-6) \h 



□ 



The expansion performed in the above proof is slightly different from that proposed in Camp- 
bell and Shiller (1988a,b), where log(Pt + D t ) is approximated by $ log Pt + (1 — i?) log A + x 
with = 1/(1 + 3) and x = log(l + 3) — $S log 3- In the same spirit of the fourth point in the 
Appendix of Campbell and Shiller (1988a), in the next section we will evaluate the impact of 
the approximation on the quality of regressions. 

Corollary 4. If 3 is linear in log(e)o° — logPo, than equation (12) is able to reproduce the 
linear scaling of dividends' contribution to the yield with the initial log EP ratio on the long 
run. 

We are now ready to state the main theoretical result of this paper. 

Preposition 5. The expected gross yield (1) is linear in J and 3, provided we consider suffi- 
ciently long horizons 



Eofe/h] ~g{l + ?) + 9 + 



The proof is an immediate consequence of the Corollaries 2 and 4. More interestingly, from 
the proofs of previous Lemmas we can explicitely identify the contribution to the long term yield 
whose scaling over time is more persistent, and which ultimately determines its convergence 
towards the limit #(1 + jF) + 3- The expression of the leading correction proportional to one 
over h reads 



1-A_A_ 

1 + k- 



(1 - A_) 2 (l - A + )2 



log£P 4 3 ( 1-^) (logDPn-logS). (11) 



From equation (13) we know that (1 — 9) 1 plays the role of a damping function. Assuming 
<C 1, it reduces to exp(— i/r e ) where the quantity r e = 1/9 plays the role of the typical 
time scale of the mean-reverting process. We reasonably expect that the last term in the 
expression (14) does not contribute too much to the leading correction, since we neither expect 
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an extremely large time scale for the process nor an extreme discrepancy between log DPq and 
log 9- Another interesting point to note is the choice of /xo plays a minor role in relation to 
the speed of convergence of the process to the long term expected value. Indeed, /i does not 
appear in expression (14), while from (10) we know that its effect is exponentially damped by 
A fe and A+. Being < A_ < A + < 1, the dominating contribution for h ^> 1 is given by Ait, 
implying a typical damping scale equal to — (logA + ) _1 . 



4 Parameters calibration 

Corollaries 2 and 4 state that the linear scaling of long term yield with the level of log EP ratio 
can be reproduced by our dynamical model if we assume for the quantities g(l + $) and 9 the 
form g{l + !?) = a$ + /^logEPo and 9 = «g + /?g logEP . This suggest an immediate and 
easy way to fix the values of ay, (3j, ag, and /3g. In Tables 2 - 4 we have reported the values of 
ah, e ah , (3h, and ep h , with the index h — 24 x i for i = 1, . . . , 8 corresponding to the number of 
elapsed months, obtained from the regression of (log P t+ h — logP t ) /h on log(e)J° — logP t . The 
values for a? and e aiJ are then computed by 



2 



ay = ^2,w h a h with w h = _ ; an d e ^ = ^ w, ■ . 

h €ah z2h^ R h h 

and similarly for (3j and the associated error. The weights have been chosen in order not only 
to privilege regression's coefficients with the lowest relative error e ah /\cth\, but also those with 
the highest R squared. The quantities labelled with g are obtained in the same way after 
regressing the dividends' contribution to the yield | Yli=o 1°S y- + f^tf ) on ^ e ^ n ^^ a ^ 1°§ 
EP. Equipped with the values of ag, and /3g, see Table 5, in Table 6 we report the com- 



parison between the regression of ^E Yli=o 1°S fl + 
SiE [EtJ (l + log A) 



and the approximate expression 



on information provided by the log EP ratio. We explicitly de- 
tail cases of Belgium, France, United Kingdom and Germany, but all other countries perform 
exactly in the same way. Results are in full statistical agreement, with a strong level of sig- 
nificance. There are small differences, but they do not invalidate the overall picture. Once 
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Figure 4: Monte Carlo scenarios generated simulating the model (5)- (12) with inital time 
conditions Y , /x , and log DP equal to the empirical ones; the solid line corresponds to the 
linear regression with coefficient as in Table 5. Color scale as in Figures 1-3. 



22 





9(1 + 


3) = 


a s + #rlog EP 


9 = 


a s + p g log EP 






a 7 






a s 


h 


Australia 


-18 


±30 


-12 ±6 


61 


±2 


6.2 ±0.4 


Belgium 


510 


± 16 


87 ±3 


138 


±5 


20 ± 1 


Canada 


-142 


±20 


-35 ±4 


83 


±2 


11.7 ±0.4 


France 


356 


± 11 


55 ±2 


43 


± 1 


3.7 ±0.2 


Germany 


549 


± 19 


90 ±4 


55 


±2 


5.9 ±0.4 


Japan 


581 


±23 


94 ±4 


13 


± 1 


1.0 ±0.2 


Netherlands 


504 


± 14 


85 ±3 


78 


±2 


9.8 ±0.4 


Norway 


603 


±51 


110 ±10 


45 


±4 


5.6 ±0.8 


Sweden 


578 


±35 


93 ±6 


34 


± 1 


3.2 ±0.3 


Switzerland 


322 


±22 


49 ±4 


47 


± 1 


5.9 ±0.2 


United Kingdom 


273 


± 15 


48 ±3 


69 


±2 


7.6 ±0.4 


United States 


295 


±14 


54 ±3 


127 


±3 


17.7 ±0.6 



Table 5: All values are expressed in 10 4 units. 

the linear functions J and S have been estimated, the next relevant parameter to fix is the 
rate of exponential growth of averaged earnings g. This can be easily obtained iteratively, first 
regressing a vector of 196 monthly logarithmic averages on a linear function of h, then rolling 
the window and repeating the estimate from January 1881 until the last admissible date. In 
this way we obtain a sample of g values from which empirical 68% confidence intervals can be 
computed, for the results on a monthly basis from Shiller's data series for the US market please 
refer to Table 7. The value of 9 and <7d are estimated regressing A(d t -i— X t ) on d t ~i — X t — log S, 
employing the same technique of rolling windows discussed above. The measured mean value 
for 9 is equal to 0.0207, and following the consideration written at the end of the previous 
section this value implies a typical scale for mean-reversion of order 48 months. The param- 
eters of the \i t process are also estimated by means of linear regression of ji t +i on fi t , and on 
log(e)J° — Xt + "K + gJt. In the first instance this require to introduce a suitable proxy for 
the unobservable variable fif Our approach has been to estimate it by means of X t — Xt-i, 
but different choices are admissible, for example in terms of an exponential weighted average 
of past price returns. However in this latter case an undesirable dependence of the parameters 
on the value of the exponential weight would be introduced. Moreover "K is still an unknown 
quantity that we have to choose before performing the optimization procedure. Recalling ex- 
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24 



9 (xlO" 4 ) 


e (xlO- 4 ) 


7 


K (XlO" 4 ) 


^(xlO- 4 ) 


^(xlO- 4 ) 


a\ (xlO" 4 ) 


12 (-3,31) 


207 (62,352) 


0.26 (0.19,0.32) 


201 (68,323) 


13 (11,26) 


12 (10,21) 


18 (17,19) 



Table 7: Estimated values of the parameters from Shiller's time series and 68% confidence 
intervals. 

pression (14), it is reasonable to assume it linear in the log EP ratio IK = + (3^ log EP 
and to initialize the procedure with arbitrary values for the linear coefficients. The numerical 
process then produces optimal estimates of 7 and k, for suitable and (3jt minimizing the 
leading correction term (14). In particular, for a<yi = 1.10 and 0^ = —0.79 we have found 
7 = 0.26 and k = 0.02 both satisfying the constraint 4k ^ (1 — r y) 2 . For the estimate of ax we 
approach the problem from a different point of view. Indeed from equation (7) we know that 
the variance of (Xh — Xo)/h scales as one over h with proportionality constant equal to a\. 
Estimating the variance of price returns over h months for h ranging from 24 until 196, and 
fitting the curve with a linear relation we have found the value of 0.18% reported in Table 7. 
Ultimately, we test the goodness of the estimated parameters and in Figure 4 we present the 
results of a Monte Carlo simulation of the model (5)- (12). Each point corresponds to a single 
realization of yt.h with h = 24, . . . , 196 months with t starting from January 1881. The initial 
time values of Y , /i , and log -DP are fixed equal to the empirical ones. We have also plotted 
the linear relation between yields and the log EP predictor with coefficients specified in Table 5. 
The numerical test confirms the ability of the proposed model to capture the relevant features 
of the historical time series, in particular the shrinking of the data cloud to the expected value 
with a scaling exponent dominated by the diffusive component of the price dynamics. 

5 Conclusion and Perspectives 

This paper documents substantial predictability of stock index returns on the basis of a simple 
valuation metric: the cyclically adjusted price-earnings ratio. 

Practitioners and academicians alike have been using several valuation measures for esti- 
mating the intrinsic value of a stock index: for example in table 2 of Poterba and Samwick 
(1995) the ratio of market value of corporate stock to GDP, the year-end price-to-earnings ratio, 
the year-end price-to-dividend ratio and Tobin's q are reported from 1947 to 1995 in an effort 
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of alerting the reader on the possible overvaluation of the index 5 . In particular Tobin's q has 
been proposed as another efficient method of measuring the value of the stock market, with 
an efficiency comparable to the CAPE Smithers (2009). The q ratio is the ratio of price to 
net worth at replacement cost rather than the historic or book cost of companies. It therefore 
allows for the impact of inflation, much alike the CAPE which averages real earnings over a 
ten year span. It would be interesting to carry out an empirical analysis of the relationship 
between Tobin's q and future stock index returns also for countries other than the U.S., but we 
have not been able to find high quality long-term time series of the g-ratio outside the U.S.. 

Substantial evidence of the importance of fundamentals in the valuation of international 
stock markets has been accumulated by the proponents of fundamental indexation Arnott 
et al. (2005). 

Clearly our empirical analysis should be extended so as to cover longer historical periods. 
Longer time-series would also be important so as to investigate possible "evolutionary" phe- 
nomena like the end of the relationship between market valuation and interest rates, which 
may perhaps be interpreted as an example of the adaptive market hypothesis Farmer and Lo 
(1999), Lo (2004). 
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